Using an underspecified ASR system as an indicator for phonetic similarity

نویسندگان

  • Mark Kane
  • Julie Mauclair
  • Julie Carson-Berndsen
چکیده

This paper presents a novel approach to the identification of phonetic similarity using properties observed during the speech recognition process. An experiment is presented whereby specific phones are removed during the training phase of a statistical speech recognition system so that the behaviour of the system can be analysed to see which alternative phone is selected. The domain of the analysis is restricted to specific contexts and the alternatively recognised (or substituted) phones are analysed with respect to a number of factors namely, the common phonetic properties, the phonetic neighbourhood and the frequency of occurrence in the complete corpus. The results indicate that a measure of phonetic similarity based on alternatively recognised observed properties can be predicted based on a combination of these factors and as such can serve as an important additional source of information for the purposes of pronunciation variation in speech recognition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Underspecified Feature Models for Pronunciation Variation in Asr

In the 1990s, several studies showed that if we could just predict correctly when to include alternate pronunciations of words in ASR lexica, we could greatly reduce error rates for conversational speech tasks (i.e., Switchboard). But it is clear that the field has thus far failed to reach that potential. Many scholars model pronunciation variation via a substitution of one phonetic sequence fo...

متن کامل

Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers

In spite of decades of research, Automatic Speech Recognition (ASR) is far from reaching the goal of performance close to Human Speech Recognition (HSR). One of the reasons for unsatisfactory performance of the state-of-the-art ASR systems, that are based largely on Hidden Markov Models (HMMs), is the inferior acoustic modeling of low level or phonetic level linguistic information in the speech...

متن کامل

A measure of phonetic similarity to quantify pronunciation variation by using ASR technology

It attracts researchers’ interest how to define a quantitative measure of phonetic similarity between IPA transcripts of the same sentence read by two speakers. This problem can be divided into how to align two transcripts and how to quantify alignment gap. In this paper, we introduce a method of similarity calculation using phone-based or phoneme-based acoustic models trained with the algorith...

متن کامل

Automatic detection of mild cognitive impairment from spontaneous speech using ASR

Mild Cognitive Impairment (MCI), sometimes regarded as a prodromal stage of Alzheimer’s disease, is a mental disorder that is difficult to diagnose. However, recent studies reported that MCI causes slight changes in the speech of the patient. Our starting point here is a study that found acoustic correlates of MCI, but extracted the proposed features manually. Here, we automate the extraction o...

متن کامل

Intra-speaker variation and units in human speech perception and ASR

Research on speech perception and ASR has resulted several important advances in our understanding of speech variation: one is that speaker dependent variation is systematic, another is that inter-speaker and intra-speaker variation diverge in their root causes and characteristics. Therefore, a successful approach to one may not always transfer to the other. Intertalker variation, or indexical ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009